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We propose a logical framework combining a game-theoretic study of abilities of agents to achieve 
quantitative objectives in multi -player games by optimizing payoffs or preferences on outcomes with 
a logical analysis of the abilities of players for achieving qualitative objectives of players, i.e., reach- 
ing or maintaining game states with desired properties. We enrich concurrent game models with 
payoffs for the normal form games associated with the states of the model and propose a quantitative 
extension of the logic ATL* enabling the combination of quantitative and qualitative reasoning. 



1 Introduction 

There are two rich traditions in studying strategic abilities of agents in multi-player games: 

Game theory has been studying rational behavior of players, relevant for their achievement of quan- 
titative objectives: optimizing payoffs (e.g., maximizing rewards or minimizing cost) or, more generally, 
preferences on outcomes. Usually, the types of games studied in game theory are one-shot normal form 
games, their (finitely or infinitely) repeated versions, and extensive form games. 

Logic has been mostly dealing with strategic abilities of players for achieving qualitative objectives: 
reaching or maintaining outcome states with desired properties, e.g., winning states, or safe states, etc. 

Among the most studied models in the logic tradition are concurrent game models ||5]|2T]|. On the 
one hand they are richer than normal form games, as they incorporate a whole family of such games, 
each associated with a state of a transition system; but on the other hand, they are somewhat poorer 
because the outcomes of each of these normal form games, associated with a given state, are simply the 
successor states with their associated games, etc. whereas no payoffs, or even preferences on outcomes, 
are assigned. Thus, plays in concurrent game models involve a sequence of possibly different one- 
shot normal form games played in succession, and all that is taken into account in the purely logical 
framework are the properties - expressed by formulae of a logical language - of the states occurring in 
the play. Concurrent game models can also be viewed as generalization of (possibly infinite) extensive 
form games where cycles and simultaneous moves of different players are allowed, but no payoffs are 
assigned. 

Put as a slogan, the game theory tradition is concerned with how a player can become maximally 
rich, or how to pay as little cost as possible, while the logic tradition - with how a player can achieve a 
state of 'happiness', e.g. winning, or to avoid reaching a state of 'unhappiness' (losing) in the game. 

The most essential technical difference between qualitative and quantitative players' objectives is 
that the former typically refer to (a temporal pattern over) Boolean properties of game states on a given 
play and can be monitored locally whereas the latter are determined by the entire history of the play 
(accumulated payoffs) or even the whole play (its value, being a limit of average payoffs, or of discounted 
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accumulated payoffs). It is therefore generally computationally more demanding and costly to design 
strategies satisfying quantitative objectives or to verify their satisfaction under a given strategy of a 
player or coalition. 

These two traditions have followed rather separate developments, with generally quite different agen- 
das, methods and results, including, inter alia: 

• on the purely qualitative side, logics of games and multiagent systems, such as the Coalition logic 
CL ||2T1 . the Alternating time temporal logic ATL Q, and variations of it, see e.g. lfT5l . IIT81 . etc., 
formalizing and studying qualitative reasoning in concurrent game models; 

• some single-agent and multi-agent bounded resource logics ||9j [3j [T9l extending or modifying 
concurrent game models with some quantitative aspects by considering cost of agents' actions and 
reasoning about what players with bounded resources can achieve. 

• extensions of qualitative reasoning (e.g., reachability and Biichi objectives) in multi-player con- 
current games with 'semi-quantitative' aspects by considering a preference preorder on the set 
of qualitative objectives, see e.g., (6l, Q, thereby adding payoff-maximizing objectives and thus 
creating a setting where traditional game-theoretic issues such as game value problems and Nash 
equlibria become relevant. 

• deterministic or stochastic infinite games on graphs, with qualitative objectives: typically, reach- 
ability, and more generally - specified as co-regular languages over the set of plays, see e.g. El . 

main. 

• on the purely quantitative side, first to mention repeated games, extensively studied in game theory 
(see e.g., [20]), which can be naturally treated as simple, one-state concurrent game models with 
accumulating payoffs paid to each player after every round and no qualitative objectives; 

• from a more computational perspective, stochastic games with quantitative objectives on dis- 
counted, mean or total payoffs, in particular energy objectives, see e.g. ifTTl . 

• the conceptually different but technically quite relevant study of counter automata, Petri nets, 
vector addition systems, etc. - essentially a study of the purely quantitative single-agent case of 
concurrent game models (see e.g. lfl4l ). where only accumulated payoffs but no qualitative objec- 
tives are taken into account and a typical problem is to decide reachability of payoff configurations 
satisfying formally specified arithmetic constraints from a given initial payoff configuration. 

A number of other relevant references discuss the interaction between qualitative and quantitative 
reasoning in multi-player games, e.g. ll22l . ifloTl . which we cannot discuss here due to space limitations. 

This project purports to combine the two agendas in a common logical framework, by enriching 
concurrent game models with payoffs for the one-shot normal form games associated with the states, 
and thus enabling the combination of quantitative game-theoretic reasoning with the qualitative logical 
reasoning. Again, put as a slogan, our framework allows reasoning about whether/how a player can 
reach or maintain a state of 'happiness' while becoming, or remaining, as rich as (rationally) possible, 
or paying the least possible price on the way. The purpose of this extended abstract is to introduce and 
discuss a general framework of models and logics for combined quantitative and qualitative reasoning 
that would naturally cover each of the topics listed above, and to initiate a long term study on it. 

2 Preliminaries 

A concurrent game model [5] (CGM) = (Ag,St,{Act a } ae Ag,{act a } ae Ag,out, Prop, L) comprises: 
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• a non-empty, fixed set of players Ag = { 1 , . . . , k} and a set of actions Act a ^ for each a G Ag. 
For any A C Ag we will denote ActA := FLeA Act a and will use c^a to denote a tuple from Act^- 
In particular, Act^g is the set of all possible action profiles in y. 

• a non-empty set of game states St. 

• for each a G Ag a map act a : St — > <^(Act a ) setting for each state s the actions available to a at s. 

• a transition function out : St x ActAg —> St that assigns the (deterministic) successor (outcome) 
state out(<7, Q^Ag) to every state q and action profile c^Ag = (oci, ■ ■ ■ , ccw) sucn that a a e act a(^) 
for every a G Ag (i.e., every a a that can be executed by player a in state q). 

• a set of atomic propositions Prop and a labelling function L : St — > ^(Prop). 

Thus, all players in a CGM execute their actions synchronously and the combination of these actions, 
together with the current state, determines the transition to a (unique) successor state in the CGM. 

The logic of strategic abilities ATL* (Alternating-Time Temporal Logic), introduced and studied in 
(21, is a logical system, suitable for specifying and verifying qualitative objectives of players and coali- 
tions in concurrent game models. The main syntactic construct of ATL* is a formula of type ((C)) 7, 
intuitively meaning: "The coalition C has a collective strategy to guarantee the satisfaction of the objec- 
tive yon every play enabled by that strategy." Formally, ATL* is a multi-agent extension of the branching 
time logic CTL*, i.e., multimodal logic extending the linear-time temporal logic LTL- comprising the 
temporal operators X ("at the next state"), G ("always from now on") and U ("until") - with strategic 
path quantifiers ((C)) indexed with coalitions C of players. There are two types of formulae of ATL*, 
state formulae, which constitute the logic and that are evaluated at game states, and path formulae, that 
are evaluated on game plays. These are defined by mutual recursion with the following grammars, where 
C C Ag, p G Prop: state formulae are defined by q> ::= p | -i<p | cp A (p | ((C))/, and path formulae by 
7::= cp I -17 1 /A 7 1 Xy| Gy I 7U7. 

The logic ATL* is very expressive and that comes at a high computational price: satisfiability and 
model checking are 2ExpTime-complete. A computationally better behaved fragment is the logic ATL, 
which is the multi-agent analogue of CTL, only involving state formulae defined by the following gram- 
mar, for C C Ag, p G Prop: cp ::= p | -<<p | cp A (p | ((C))X<p | ((C)) Gp | ((C)) (pUp). For this logic sat- 
isfiability and model checking are ExpTime-complete and P-complete, respectively. We will, however, 
build our extended logical formalism on the richer ATL* because we will essentially need the path-based 
semantics for it. 

Arithmetic Constraints. We define a simple language of arithmetic constraints to express con- 
ditions about the accumulated payoffs of players on a given play. For this purpose, we use a set 
^Ag = { v a I a G Ag} of special variables to refer to the accumulated payoffs of the players at a given 
state and denote by Va the restriction of VXg to any group A C Ag. The payoffs can be integers, ratio- 
naif) or any reals. We denote the domain of possible values of the payoffs, assumed to be a subset of 
the reals K, by D and use a set of constants symbols X, with G X, for names of special real values (see 
further) to which we want to refer in the logical language. 

For fixed sets X and A C Ag we build the set T (X , A) of terms over X and A from X U Va by applying 
addition, e.g. v a + Vb- An evaluation of a term t G T(X,A) is a mapping 17 : XUVa — > D. We write 
T] |= t to denote that t is satisfied under the evaluation 77 . Moreover, if some order of the elements X U Va 
is clear from context, we also represent an evaluation as a tuple from D^l+I^l and often assume that 
elements from X have their canonic interpretation. The set AC(X,A) of arithmetic constraints over X 
and A consists of all expressions of the form t\ *t2 where *G{<, <,=,>,>} and t\,t2 G T(X,A). We use 
ACF(X,A) to refer to the set of Boolean formulae over AC(X,A); e.g. (t\ < t 2 ) A (t 2 > h) G ACF(X,A) 



Note that models with rational payoffs behave essentially like models with integer payoffs, after once-off initial re-scaling. 
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for h,t2ih ^ T(X,A). We note that the language ACF(X,A) is strictly weaker than Presburger arithmetic, 
as it involves neither quantifiers nor congruence relations. 

We also consider the set APC(X,A) of arithmetic path constraints being expressions of the type w a *c 
where a G Ag, *G{<, <,=,>,>} and c G X. The meaning of w a is to represent the value of the current 
play for the player a. That value can be defined differently, typically by computing the accumulated 
payoff over the entire play, by using a future discounting factor, or by taking the limit - if it exists - of 
the mean (average) accumulated payoff (cf. [20]). We note that the discounted, accumulated, mean or 
limit payoffs may take real values beyond the original domain of payoffs D; so, we consider the domain 
for X to be a suitable closure of D. 

3 Concurrent Game Models with Payoffs and Guards 

We now extend concurrent game models with utility values for every action profile applied at every state 
and with guards that determine which actions are available to a player at a given configuration, consisting 
of a state and a utility vector, in terms of arithmetic constraints on the utility of that player. 

Definition 1 A guarded CGM with payoffs (GCGMP) is a tuple 5Dt = (y, payoff, {g a } a eAg> VajaeAg) 
where y = (Ag,St, {Act a } ae Agj {act a } a£ Ag,out, Prop, L) is a CGM and: 

• payoff : Ag x St x ActAg — > D is a payoff function assigning at every state s and action profile 
applied at s a payoff to every agent. We write payoff a (s, ~ct) for payoff (a, 5, a). 

• g 3 : St x Act a — > ACF(X, {a}), for each player a G Ag, is a guard function that assigns for each 
state s G St and action a G Act a an arithmetic constraint formula g a (s, a) that determines whether 
a is available to a at the state s given the current value of a 's accumulated payoff. The guard must 
enable at least one action for a at s. Formally, for each state s G St, the formula VasAct ; 

must be valid. Moreover, a guard g 3 (s, a) is called state-based if g 3 (s, a) G ACF(X). 

• d a G [0, 1] is a discount factor, for each a G Ag, used in order to define realistically values of infinite 
plays for players or to reason about the asymptotic behavior of players' accumulated payoffs. 

The guard g 3 refines the function act a from the definition of a CGM, which can be regarded as a 
guard function assigning to every state and action a constant arithmetic constraint true or false. In our 
definition the guards assigned by g 3 only depend on the current state and the current accumulated payoff 
of a. The idea is that when the payoffs are interpreted as costs, penalties or, more generally, consumption 
of resources the possible actions of a player would depend on her current availability of utility/resources. 

Example 1 Consider the GCGMP shown in Figure [7] with 2 players, I and II, and 3 states, where in 
every state each player has 2 possible actions, C (cooperate) and D (defect). The transition function is 
depicted in the figure. The normal form games associated with the states are respectively versions of the 
Prisoners Dilemma at state s\, Battle of the Sexes at state S2 and Coordination Game at state S3. 

The guards for both players are defined at each state so that the player can apply any action if 
she has a positive current accumulated payoff, may only apply action C if she has accumulated payoff 
0; and must play an action maximizing her minimum payoff in the current game if she has a negative 
accumulated payoff. The discounting factors are 1 and the initial payoffs of both players are 0. 

Configurations, plays, and histories. Let 9Jt be a GCGMP defined as above. A configuration (in 
9JT) is a pair (s, it) consisting of a state s and a vector u = (wi, . . . of currently accumulated payoffs, 
one for each agent, at that state. Hereafter we refer to accumulated payoffs as utility, at a given state. We 
define the set of possible configurations as Con(9Jt) = St x Bl Ag L The partial configuration transition 

function is defined as out : Con(9tt) x ActAg xN-> Con(9Jl) such that out((j, it), ~Gd ,t) = (V, u ) iff: 
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(i) out($, a) = s' ($' is a successor of $ if is executed). 

(ii) assigning the value u a to v a satisfies the guard g 3 ($, CC a ) for each a G Ag, i.e. u a |= g 3 ($, a 3 ) (each 
agent's move a 3 is enabled at $ by the respective guard g a applied to the current accumulated utility 
value u a ). 

(iii) u' 3 =u 3 +d l 3 - payoff a ($, c^) for all a € Ag (i.e., the utility values change according to the utility 
function and the discounting rate where / denotes the number of steps that took place). 

A GCGMP Wl with a designated initial configuration ($o, wo) gives rise to a configuration graph on 
9Jt consisting of all configurations in 9JT reachable from (so, «o) by the configuration transition function. 
A play in a GCGMP SOT is an infinite sequence n = coOo,ciOi, . . . from (Con(Wl) x Act) m such that 
c n S out(c„_i, c^„-i) for all n > 0. The set of all plays in SOT is denoted by Playsgjj. Given a play n we 
use n[i] and 7r[/,oo] to refer to the z'th element and to the subplay starting in position i of %, respectively. 
A history is any finite initial sequence h = coOo,ciG!i, • ■ • ,c n € (Con(5DT) x Act)*Con(9Jt) of a play in 
Plays OT . The set of all histories is denoted by Histt^- For any history h we also define h[i] as for plays 
and additionally h[last] and h[i,j] to refer to the last state on h and to the sub-history between i and j, 
respectively. Finally, we introduce functions - c , •", and - s which denote the projection of a given play 
or history to the sequence of its configurations, utility vectors, and states, respectively. For illustration, 

let us consider the play n = coOo,ciOi, We have that Ti[i,°°\ = Ci~ctj,Ci + iai + \,. . . ; n[i] = c, c^ ; -; 

7i c [/,oo] = a,a + i, . . . ; n c [i] = a; x a [i] = ^; = v,-; and K s [i\ = s { where a = (j,-, w;). 

Example 2 Some possible plays starting from si in Example^ are given in the following where we 
assume that the initial accumulated payoff is for both agents. We note that this implies that the first 
action taken by any agent is always C. 

1. Both players cooperate forever: ($1,0,0), ($1,2,2), (si, 4,4),. . . 

2. After the first round both players defect and the play moves to $2, where player I chooses to defect 
whereas II cooperates. Then I must cooperate while II must defect but at the next round can choose 
any action, so a possible play is: ($i,0,0), ($1, 2, 2), ($2,1, 1), ($2,0, —1), ($2,0, 1), ($2,0,3), ($2,0,5), 

3. After the first round player I defects while II cooperates and the play moves to $3, where they can 
get stuck indefinitely, until — if ever — they happen to coordinate, so a possible play is: 
($1,0,0), ($ b 2,2), ($ 3 , 5,-2), ($ 3 , 4, -3), ($3, 3,-4), ...($3, 0,-7), ($ 3 , -1,-8),.... 

Note, however, that once player I reaches accumulated payoff he may only apply C at that round, 
so if player II has enough memory or can observe the accumulated payoffs of I he can use the 
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opportunity to coordinate with I at that round by cooperating, thus escaping the trap at S3 and 
making a sure transition to S2- 
4. If, however, the guards did not force the players to play C when reaching accumulated payoffs 0, 
then both players could plunge into an endless misery if the play reaches S3. 

Strategies. A strategy of a player a is a function s 3 : Hist — > Act such that if s a (h) = a then 
h u [last] 3 \= g a (h s [last],a); that is, actions prescribed by a strategy must be enabled by the guard. Our 
definition of strategy is based on histories of configurations and actions, so it extends the notion of strat- 
egy from [51 where it is defined on histories of states, and includes strategies, typically considered e.g. 
in the study of repeated games, where often strategies prescribe to the player an action dependent on 
the previous action, or history of actions, of the other player(s). Such are, for instance, TlT-FOR-TAT 
or Grim-trigger in repeated Prisoners Dillemma; likewise for various card games, etc. Since our 
notion of strategy is very general, it easily leads to undecidable model checking problems. So, we also 
consider some natural restrictions, such as: state-based, action-based or configuration-based, memo- 
ryless, bounded memory, of perfect recall strategies Here we adopt a generic approach and assume 
that two classes of strategies y p and y° are fixed as parameters, with respect to which the proponents 
and opponents select their strategies, respectively. The proponent coalition A selects a J^-strategy s A 
(i.e. one agreeing with the class y p ) while the opponent coalition Ag\A selects a o^-strategy s Ag \ A . 
The outcome play outcome_playgjj(c, (^A^Ag\A))0 i n a given GCGMP 9Jt determines the play emerging 
from the execution of the (complete) strategy profile (s A ,s Ag \ A ) from configuration c in 9JT. 

4 The Logic: Quantitative ATL* 

We now extend the logic ATL* to the logic QATL* with atomic quantitative objectives being state or path 
arithmetic constraints over the players' accumulated payoffs. The semantics of QATL* naturally extends 
the semantics of ATL* over GCGMPs, but parameterised with the two classes of strategies S fip and <y°. 

Definition 2 (The logic QATL*) The language o/QATL* consists of state formulae q>, which constitute 
the logic, and path formulae 7 generated as follows, where A C Ag, ac G AC, ape G APC, and p G Prop: 
<p ::=p I ac | -i<p | <p A(p \ ((A))yand 7::= 9 | ape | ->y \ J/\j \ Xy| Gy| 7U7. 
Let 9JT be a GCGMP, c a configuration, (p,(p\,(f>2 state-formulae, 7,71,72 path formulae, and I G 
N. Further, let y p and 5?° be two classes of strategies as described above. The semantics of the 
path constraints is specified according to the limit-averaging or discounting mechanism adopted for 
computing the value of a play for a player. Then the truth of a QATL* formula at a position of a 
configuration in 9JI is defined by mutual recursion on state and path formulae as follows: 
S0T,c,Z |= p for p G Prop iffp G l(c s ); Wl,c,l \= ac for ac G AC iff c 11 |= ac, 

9Jt,c,l |= ((A)) 7 iff there is a collective y p -strategy s A for A such that for all collective Sf° -strategies 

s Ag\Af or Ag\A we have that 59T,outcome_play OT (c, (sA,SAg\A W H 7- 
Tl,n,l\=(p ifffm,7c[0],l\=q>; Wl,n,l \= ape iff it 1 ' , I |= apc/or ape G APC. 
m,n,l\=Gy iff mi, n[([ ,l + i\= yfor all i G N , 
97t,7F,Z \=Xy iffWl,7i[\],l + \ |= 7 

Wl, % , / |= 71 Uy2 iff there is j G No such that SDT, , I + j \= Ji and ffl, , I + i |= 71 for all < i < j. 
Ultimately, we define £DT,c |= (p as Wl,c,\ |= <p. Moreover, if not clear from context, we also write 

\=(yp,y)f or h- 

2 We note that all strategies need to be consistent with the guards, so state-based strategies are only applicable in models 
where the guards only take into account the current state, but not the accumulated payoffs. 
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The semantics presented above extends the standard semantics for ATL* and is amenable to various 
refinements and restrictions, to be studied further. For instance, if appropriate, an alternative semantics 
can be adopted, based on irrevocable strategies HI or, more generally, on strategy contexts (H or other 
mechanisms for strategy commitment and release [2]. Also, the nested operators as defined here access 
the accumulated utility values and require plays to be infinite. Similarly to ||9]|, one can consider variants 
of these settings which may yield decidable model checking and better complexity results. 

As the logic QATL* extends ATL*, it allows expressing all purely qualitative ATL* properties. It 
can also express purely quantitative properties, e.g.: (({a}))G(v a > 0) meaning "Player a has a strategy 
to maintain his accumulated payoff to be always positive", or ((A)) (w a > 3) meaning "The coalition A 
has a strategy that guarantees the value of the play for player a to be at least 3". Moreover, QATL* can 
naturally express combined qualitative and quantitative properties, e.g. (({a, b})) ((v a + Vb > l)Up)), etc. 

Example 3 The following QATL* state formulae are true at state s\ of the GCGMP in Example^ where 
Pi is an atomic proposition true only at state Sj, for each i = 1,2, 3: 

(i) (({/,//})) F(pi A v/> 100 A vn > 100) A (({/,//})) XX (({//})) {G{p 2 A v/ = 0) A F vn > 100). 

(ii) -,(({/})) G(pi Vv/ > 0)A-.(({/,//}))F(p3 AG(p 3 A(vi+ V n > 0))). 

5 (Un)Decidability: Related Work and Some Preliminary Results 

Generally, the GCGMP models are too rich and the language of QATL* is too expressive to expect 
computational efficiency, or even decidability, of either model checking or satisfiability testing. Some 
preliminary results and related work show that model checking of QATL* in GCGMPs is undecidable 
under rather weak assumptions, e.g. if the proponents or the opponents can use memory -based strategies. 
These undecidability results are not surprising as GCGMPs are closely related to Petri nets and vector 
addition systems and it is known that model checking over them is generally undecidable. In |[T3l . for 
example, this is shown for fragments of CTL and (state-based) LTL over Petri nets. Essentially, the 
reason is that the logics allow to encode a "test for zero"; for Petri nets this means to check whether 
a place contains a token or not. In our setting undecidability follows for the same reason, and we will 
sketch some results below. 

Undecidability results. The logic QATL restricts QATL* in the same way as ATL restricts ATL*, due 
to lack of space we skip the formal definition. As a first result we show that model checking QATL is 
undecidable even if only the proponents are permitted to use perfect recall strategies and the opponents 
are bound to memoryless strategies. More formally, let S pr denote the class of perfect recall state-based 
strategies and S m the class of memoryless state -based strategies. That is, strategies of the former class 
are functions of type St* — > Act and of the latter class functions of type St — > Act. 

Undecidability can be shown using ideas from e.g. fl9] H3J . Here, we make use of the construction 
of [9] to illustrate the undecidability by simulating a two-counter machine (TCM). A TCM ffTTl can 
be considered as a transition system equipped with two integer counters that enable/disable transitions. 
Each step of the machine depends on the current state, symbol on the tape, and the counters, whether 
they are zero or not. After each step the counters can be incremented (+1), or decremented (—1) , the 
latter only if the respective counter is not zero. A TCM is essentially a (nondeterministic) push-down 
automaton with two stacks and exactly two stack symbols (one of them is the initial stack symbol) and 
has the same computation power as a Turing machine (cf. iTTTl ). A configuration is a triple (s,w\,W2) 
describing the current state (s), the value of counter 1 (wi) and of counter 2 (wz)- A computation 8 is a 
sequence of subsequent configurations effected by transitions. 
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For the simulation, we associate each counter with a player. The player's accumulated payoff encodes 
the counter value; actions model the increment/decrement of the counters; guards ensure that the actions 
respect the state of the counters. The accepting states of the two-counter machine are encoded by a special 
proposition halt. Now, the following lemma stating the soundness of the simulation can be proved: 

Lemma 1 (Reduction) For any two-counter machine A we can construct a finite GCGMP 9Jl A with two 
players and proposition halt such that the following holds: A halts on the empty input iff9Jl A contains a 
play 71 with % c = (s°, (v^vSj))^ 1 , (vJjV^)) . . . such that there exists j EN with halt £ L(j y '). 

The next theorem gives two cases for which the model checking problem is undecidable. By the 
previous Lemma we have to ensure that the halting state is reached which can be expressed by ((l))Fhalt. 
We can also use purely state-based guards and encode the consistency checks in the formula as follows: 
((l))(vi > Av2 > Ae\ — > v a = A^2 — > V2 = 0)Uhalt where the proposition e\ is added to the model 
to indicate that the value of counter i is zero. Not that this information is static and obtained from the 
transition relation of the automaton. 

Proposition 1 Model checking the logic QATL is undecidable, even for the 2 agent case and no nested 
cooperation modalities, where = S pr and = S m . This does even hold either for formulae not 
involving arithmetic constraints, or for state-based guards. 

Restoring decidability. There are some natural semantic and syntactic restrictions of QATL* where 
decidability may be restored; these include for instance, the enabling of only memoryless strategies, 
imposing non-negative payoffs, constraints on the transition graph of the model, bounds on players 
utilities etc. For instance, the main reason for the undecidability result above is the possibility for negative 
payoffs that allow for decrementing the accumulated payoffs and thus simulating the TCM operations. 
Therefore, a natural restriction in the quest for restoring decidability is to consider only GCGMP models 
with non-negative payoffs. In this case the accumulated payoffs increase monotonically over every play 
of the game, and therefore the truth values of every arithmetic constraint occurring in the guards and in 
the formula eventually stabilize in a computable way, which in the long run reduces the model checking 
of any QATL-formula in an GCGMP to a model checking of an ATL-formula in a CGM. One can thus 
obtain decidability of the model checking of the logic QATL in finite GCGMP with non-negative payoffs 
and perfect information. We will discuss these and other decidability results in a future work, where we 
will also consider restrictions similar to 0. 

6 Concluding Remarks 

This paper proposes a long-term research agenda bringing together issues, techniques and results from 
several research fields. It aims at bridging the two important aspects of reasoning about objectives and 
abilities of players in multi-player games: quantitative and qualitative, and eventually providing a uni- 
form framework for strategic reasoning in multi-agent systems. 
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